Incremental Maintenance of Data Warehouses Based on Past Temporal Logic Operators

نویسندگان

  • Sandra de Amo
  • Mirian Halfeld Ferrari Alves
چکیده

We see a temporal data warehouse as a set of temporal views defined in the past fragment of the temporal relational algebra extended with set-valued attributes and aggregation. This paper proposes an incremental maintenance method for temporal views that allows improvements over the re-computation from scratch. We introduce a formalism for temporal data warehouse specification that summarizes information needed for its incremental maintenance. According to this formalism, a temporal data warehouse W is a pair of two sets of views: the materialized component and the virtual component. The materialized component of W represents the set of views physically stored in the warehouse. The virtual component of W is a set of non-temporal expressions involving only relations kept in the materialized component. Several features of our approach make it especially attractive as a maintenance method for warehouses: (a) there is no need for storing the entire history of source databases, (b) maintenance of the temporal data warehouse is reduced to maintaining the (non-temporal) materialized component, and (c) the materialized component is self-maintainable. We build a uniform algorithm by combining two previously unrelated techniques based on auxiliary views. Our method is sufficiently general so that it can be easily adapted to treating databases with complex-valued attributes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modélisation et manipulation de données historisées et archivées dans un entrepôt orienté objet

This paper deals with temporal and archive object-oriented data warehouse modelling and querying. In a first step, we define a data model describing warehouses as central repositories of complex and temporal data extracted from one information source. The model is based on the concepts of warehouse object and environment. A warehouse object is composed of one current state, several past states ...

متن کامل

Incremental ETL Pipeline Scheduling for Near Real-Time Data Warehouses

We present our work based on an incremental ETL pipeline for on-demand data warehouse maintenance. Pipeline parallelism is exploited to concurrently execute a chain of maintenance jobs, each of which takes a batch of delta tuples extracted from source-local transactions with commit timestamps preceding the arrival time of an incoming warehouse query and calculates Ąnal deltas to bring relevant ...

متن کامل

Evaluation of view maintenance with complex joins in a data warehouse environment

Data warehouse maintenance and maintenance cost has been well studied in the literature. Integrating data sources, in a data warehouse environment, may often need data cleaning, transformation, or any other function applied to the data in order to integrate it. The impact on view maintenance, when data is integrated with other comparison operators than defined in theta join, has, however, not b...

متن کامل

Performance Analysis of WHIPS Incremental Maintenance

Incremental maintenance incorporates new changes automatically and continuously into a data warehouse , and seems to be the best maintenance solution for very large warehouses. However, the performance of incremental maintenance algorithms is not well understood, and commercial incremental maintenance systems are still not widely available. In this paper, we study the performance of WHIPS, a pr...

متن کامل

WPI - CS - TR - 00 - 16 May 2000 Scalable Maintenance in Distributed Data

The maintenance of data warehouses is becoming an increasingly important topic due to the growing use, derivation and integration of digital information. Most previous work has dealt with one centralized data warehouse (DW) only. In this paper, we now focus on environments with multiple data warehouses that are possibly derived from other data warehouses. In such a large-scale environment, data...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. UCS

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2004